Search CORE

137 research outputs found

Factorizing LambdaMART for cold start recommendations

Author: Alexandros Kalousis
CJ Burges
CJ Burges
D Cai
J Fürnkranz
JH Friedman
Jun Wang
M Hilario
N Srebro
Phong Nguyen
Publication venue
Publication date: 04/11/2015
Field of study

Recommendation systems often rely on point-wise loss metrics such as the mean squared error. However, in real recommendation settings only few items are presented to a user. This observation has recently encouraged the use of rank-based metrics. LambdaMART is the state-of-the-art algorithm in learning to rank which relies on such a metric. Despite its success it does not have a principled regularization mechanism relying in empirical approaches to control model complexity leaving it thus prone to overfitting. Motivated by the fact that very often the users' and items' descriptions as well as the preference behavior can be well summarized by a small number of hidden factors, we propose a novel algorithm, LambdaMART Matrix Factorization (LambdaMART-MF), that learns a low rank latent representation of users and items using gradient boosted trees. The algorithm factorizes lambdaMART by defining relevance scores as the inner product of the learned representations of the users and items. The low rank is essentially a model complexity controller; on top of it we propose additional regularizers to constraint the learned latent representations that reflect the user and item manifolds as these are defined by their original feature based descriptors and the preference behavior. Finally we also propose to use a weighted variant of NDCG to reduce the penalty for similar items with large rating discrepancy. We experiment on two very different recommendation datasets, meta-mining and movies-users, and evaluate the performance of LambdaMART-MF, with and without regularization, in the cold start setting as well as in the simpler matrix completion setting. In both cases it outperforms in a significant manner current state of the art algorithms

arXiv.org e-Print Archive

Crossref

Hes-so: ArODES Open Archive (University of Applied Sciences and Arts Western Switzerland / Haute école spécialisée de Suisse occidentale / FH Westschweiz)

Archive ouverte UNIGE

Efficient AUC Optimization for Information Ranking Applications

Author: C Cortes
C Manning
CJ Burges
Q Wu
T Calders
T Fawcett
T Qin
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2016
Field of study

Adequate evaluation of an information retrieval system to estimate future performance is a crucial task. Area under the ROC curve (AUC) is widely used to evaluate the generalization of a retrieval system. However, the objective function optimized in many retrieval systems is the error rate and not the AUC value. This paper provides an efficient and effective non-linear approach to optimize AUC using additive regression trees, with a special emphasis on the use of multi-class AUC (MAUC) because multiple relevance levels are widely used in many ranking applications. Compared to a conventional linear approach, the performance of the non-linear approach is comparable on binary-relevance benchmark datasets and is better on multi-relevance benchmark datasets.Comment: 12 page

arXiv.org e-Print Archive

Crossref

People Detection and Pose Classification Inside a Moving Train Using Computer Vision

Author: C Coniglio
CJ Burges
S Nowozin
X Zeng
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2017
Field of study

This paper has been presented at : 5th International Visual Informatics Conference (IVIC 2017)Also part of the Image Processing, Computer Vision, Pattern Recognition, and Graphics book sub series (LNIP, volume 10645)The use of surveillance video cameras in public transport is increasingly regarded as a solution to control vandalism and emergency situations. The widespread use of cameras brings in the problem of managing high volumes of data, resulting in pressure on people and resources. We illustrate a possible step to automate the monitoring task in the context of a moving train (where popular background removal algorithms will struggle with rapidly changing illumination). We looked at the detection of people in three possible postures: Sat down (on a train seat), Standing and Sitting (half way between sat down and standing). We then use the popular Histogram of Oriented Gradients (HOG) descriptor to train Support Vector Machines to detect people in any of the predefined postures. As a case study, we use the public BOSS dataset. We show different ways of training and combining the classifiers obtaining a sensitivity performance improvement of about 12% when using a combination of three SVM classifiers instead of a global (all classes) classifier, at the expense of an increase of 6% in false positive rate. We believe this is the first set of public results on people detection using the BOSS dataset so that future researchers can use our results as a baseline to improve upon.The work described here was carried out as part of the OBSERVE project funded by the Fondecyt Regular Program of Conicyt (Chilean Research Council for Science and Technology) under grant no. 1140209. S.A. Velastin is grateful to funding received from the Universidad Carlos III de Madrid, the European Union’s Seventh Framework Programme for research, technological development and demonstration under grant agreement no. 600371, el Ministerio de Economía y Competitividad (COFUND2013-51509) and Banco Santander

Crossref

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Universidad Carlos III de Madrid e-Archivo

Analyzing First-Person Stories Based on Socializing, Eating and Sedentary Patterns

Author: A Cartas
A Natekin
A Torralba
AR Doherty
BC Russell
CJ Burges
E Talavera
F Pedregosa
M Bolanos
M Dimiccoli
N Srivastava
O Kramer
O Russakovsky
Publication venue
Publication date: 25/07/2017
Field of study

First-person stories can be analyzed by means of egocentric pictures acquired throughout the whole active day with wearable cameras. This manuscript presents an egocentric dataset with more than 45,000 pictures from four people in different environments such as working or studying. All the images were manually labeled to identify three patterns of interest regarding people's lifestyle: socializing, eating and sedentary. Additionally, two different approaches are proposed to classify egocentric images into one of the 12 target categories defined to characterize these three patterns. The approaches are based on machine learning and deep learning techniques, including traditional classifiers and state-of-art convolutional neural networks. The experimental results obtained when applying these methods to the egocentric dataset demonstrated their adequacy for the problem at hand.Comment: Accepted at First International Workshop on Social Signal Processing and Beyond, 19th International Conference on Image Analysis and Processing (ICIAP), September 201

arXiv.org e-Print Archive

Crossref

Aiding first incident responders using a decision support system based on live drone feeds

Author: A Banerjee
Anna C Schapiro
CJ Burges
CM Bishop
GS Dotson
J Fürnkranz
J Fürnkranz
N Cristianini
OT Yildiz
R Chen
SB Kotsiantis
SK Murthy
TS Lim
Publication venue: Springer Singapore
Publication date: 01/01/2018
Field of study

In case of a dangerous incident, such as a fire, a collision or an earthquake, a lot of contextual data is available for the first incident responders when handling this incident. Based on this data, a commander on scene or dispatchers need to make split-second decisions to get a good overview on the situation and to avoid further injuries or risks. Therefore, we propose a decision support system that can aid incident responders on scene in prioritizing the rescue efforts that need to be addressed. The system collects relevant data from a custom designed drone by detecting objects such as firefighters, fires, victims, fuel tanks, etc. The drone autonomously observes the incident area, and based on the detected information it proposes a prioritized based action list on e.g. urgency or danger to incident responders

Crossref

Ghent University Academic Bibliography

Supersymmetric Vacua in Random Supergravity

Author: A Aazami
A Achucarro
A Achucarro
A Altland
A Edelman
A Edelman
B Wit de
CJ Burges
D Marsh
David Marsh
DS Dean
DS Dean
E Katzav
F Denef
F Denef
F Dyson
F Dyson
G Borot
HP Nilles
J Wishart
L Erdős
Liam McAllister
N Metropolis
P Breitenlohner
P Breitenlohner
P Vivo
RA Horn
S Kachru
T Tao
Thomas C. Bachlechner
Timm Wrase
VA Marčenko
X Chen
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 11/07/2012
Field of study

We determine the spectrum of scalar masses in a supersymmetric vacuum of a general N=1 supergravity theory, with the Kahler potential and superpotential taken to be random functions of N complex scalar fields. We derive a random matrix model for the Hessian matrix and compute the eigenvalue spectrum. Tachyons consistent with the Breitenlohner-Freedman bound are generically present, and although these tachyons cannot destabilize the supersymmetric vacuum, they do influence the likelihood of the existence of an `uplift' to a metastable vacuum with positive cosmological constant. We show that the probability that a supersymmetric AdS vacuum has no tachyons is formally equivalent to the probability of a large fluctuation of the smallest eigenvalue of a certain real Wishart matrix. For normally-distributed matrix entries and any N, this probability is given exactly by P = exp(-2N^2|W|^2/m_{susy}^2), with W denoting the superpotential and m_{susy} the supersymmetric mass scale; for more general distributions of the entries, our result is accurate when N >> 1. We conclude that for |W| \gtrsim m_{susy}/N, tachyonic instabilities are ubiquitous in configurations obtained by uplifting supersymmetric vacua.Comment: 26 pages, 6 figure

arXiv.org e-Print Archive

Crossref

A Machine Learning Trainable Model to Assess the Accuracy of Probabilistic Record Linkage

Author: CJ Burges
DF Williamson
DG Altman
DG Altman
DG Altman
DP Silveira da
HB Newcombe
IP Fellegi
JH Friedman
L Breiman
LE Raileanu
LR Dice
M Tromp
P Christen
RS Michalski
SJ Press
VI Levenshtein
X Meng
Y Siegert
Publication venue: 19th International Conference on Big Data Analytics and Knowledge Discovery (DaWaK)
Publication date: 03/08/2017
Field of study

Record linkage (RL) is the process of identifying and linking data that relates to the same physical entity across multiple heterogeneous data sources. Deterministic linkage methods rely on the presence of common uniquely identifying attributes across all sources while probabilistic approaches use non-unique attributes and calculates similarity indexes for pair wise comparisons. A key component of record linkage is accuracy assessment — the process of manually verifying and validating matched pairs to further refine linkage parameters and increase its overall effectiveness. This process however is time-consuming and impractical when applied to large administrative data sources where millions of records must be linked. Additionally, it is potentially biased as the gold standard used is often the reviewer’s intuition. In this paper, we present an approach for assessing and refining the accuracy of probabilistic linkage based on different supervised machine learning methods (decision trees, naïve Bayes, logistic regression, random forest, linear support vector machines and gradient boosted trees). We used data sets extracted from huge Brazilian socioeconomic and public health care data sources. These models were evaluated using receiver operating characteristic plots, sensitivity, specificity and positive predictive values collected from a 10-fold cross-validation method. Results show that logistic regression outperforms other classifiers and enables the creation of a generalized, very accurate model to validate linkage results

Crossref

UCL Discovery

Intelligent OS X malware threat detection with code inspection

Author: A Case
A Fattori
A Feizollah
A Mohaisen
A Shabtai
B Scholkopf
CJ Burges
G Suarez-Tangil
GG Richard III
J Gardiner
K Shaerpour
M Nauman
M Sun
N Nissim
N Nissim
NV Chawla
P Faruki
RJ Mangialardo
S Garcia
S Huda
SY Yerima
Z Zhu
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 20/10/2017
Field of study

With the increasing market share of Mac OS X operating system, there is a corresponding increase in the number of malicious programs (malware) designed to exploit vulnerabilities on Mac OS X platforms. However, existing manual and heuristic OS X malware detection techniques are not capable of coping with such a high rate of malware. While machine learning techniques offer promising results in automated detection of Windows and Android malware, there have been limited efforts in extending them to OS X malware detection. In this paper, we propose a supervised machine learning model. The model applies kernel base Support Vector Machine (SVM) and a novel weighting measure based on application library calls to detect OS X malware. For training and evaluating the model, a dataset with a combination of 152 malware and 450 benign were is created. Using common supervised Machine Learning algorithm on the dataset, we obtain over 91% detection accuracy with 3.9% false alarm rate. We also utilize Synthetic Minority Over-sampling Technique (SMOTE) to create three synthetic datasets with different distributions based on the refined version of collected dataset to investigate impact of different sample sizes on accuracy of malware detection. Using SMOTE datasets we could achieve over 96% detection accuracy and false alarm of less than 4%. All malware classification experiments are tested using cross validation technique. Our results reflect that increasing sample size in synthetic datasets has direct positive effect on detection accuracy while increases false alarm rate in compare to the original dataset

University of Salford Institutional Repository

Crossref

White Rose Research Online

Predicting a small molecule-kinase interaction map: A machine learning approach

Author: C Hansch
C Helma
CJ Burges
CW Yap
DJ Hand
E Engvall
Fabian Buchwald
H Briem
H Mannila
HJ Böhm
JD Thompson
KR Müller
Lothar Richter
M Hall
MA Fabian
MW Karaman
N Weill
OV Buzko
R Agrawal
R Quinlan
S Keerthi
SB Needleman
SM LaValle
Stefan Kramer
U Rückert
X Xia
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background We present a machine learning approach to the problem of protein ligand interaction prediction. We focus on a set of binding data obtained from 113 different protein kinases and 20 inhibitors. It was attained through ATP site-dependent binding competition assays and constitutes the first available dataset of this kind. We extract information about the investigated molecules from various data sources to obtain an informative set of features. Results A Support Vector Machine (SVM) as well as a decision tree algorithm (C5/See5) is used to learn models based on the available features which in turn can be used for the classification of new kinase-inhibitor pair test instances. We evaluate our approach using different feature sets and parameter settings for the employed classifiers. Moreover, the paper introduces a new way of evaluating predictions in such a setting, where different amounts of information about the binding partners can be assumed to be available for training. Results on an external test set are also provided. Conclusions In most of the cases, the presented approach clearly outperforms the baseline methods used for comparison. Experimental results indicate that the applied machine learning methods are able to detect a signal in the data and predict binding affinity to some extent. For SVMs, the binding prediction can be improved significantly by using features that describe the active site of a kinase. For C5, besides diversity in the feature set, alignment scores of conserved regions turned out to be very useful.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Gutenberg Open

Rational Design of Temperature-Sensitive Alleles Using Computational Structure Prediction

Author: B Cunningham
B Lee
C Cortes
Ca Rohl
Christopher S. Poultney
CJ Burges
David Gresham
Dennis E. Shasha
EH Kellogg
G Chakshusmathi
Glenn L. Butterfoss
HM Muller
JM Word
JR Quinlan
K Bajaj
K Drew
KD Pruitt
Kevin Drew
Kristin C. Gunsalus
M Hall
Michelle R. Gutwein
N Eswar
N Siew
R Varadarajan
Richard Bonneau
RJ Dohmen
S Tweedie
SF Altschul
SF Altschul
TW Harris
Vladimir N. Uversky
WS Noble
WS Sandberg
Publication venue: Public Library of Science
Publication date: 02/09/2011
Field of study

Temperature-sensitive (ts) mutations are mutations that exhibit a mutant phenotype at high or low temperatures and a wild-type phenotype at normal temperature. Temperature-sensitive mutants are valuable tools for geneticists, particularly in the study of essential genes. However, finding ts mutations typically relies on generating and screening many thousands of mutations, which is an expensive and labor-intensive process. Here we describe an in silico method that uses Rosetta and machine learning techniques to predict a highly accurate “top 5” list of ts mutations given the structure of a protein of interest. Rosetta is a protein structure prediction and design code, used here to model and score how proteins accommodate point mutations with side-chain and backbone movements. We show that integrating Rosetta relax-derived features with sequence-based features results in accurate temperature-sensitive mutation predictions

Public Library of Science (PLOS)

Crossref

PubMed Central